A computationally efficient speech/music discriminator for radio recordings
نویسندگان
چکیده
This paper presents a speech/music discriminator for radio recordings, based on a new and computationally efficient region growing technique, that bears its origins in the field of image segmentation. The proposed scheme operates on a single feature, a variant of the spectral entropy, which is extracted from the audio recording by means of a short-term processing technique. The proposed method has been tested on recordings from radio stations broadcasting over the Internet and, despite its simplicity, has proved to yield performance results comparable to more sophisticated approaches.
منابع مشابه
Speech-Music Discrimination from MPEG-1 Bitstream
This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, ...
متن کاملTowards a Software-radio Enabled Broadcast Media Navigator
New consumer services are transforming the Hertzian broadcast media into a rich source of multimedia content. A Broadcast Media Navigator combining Softare Radio elements with Audio Indexing techniques and digital audio recording is proposed. Results from a PC-based demonstrator implementaing a simple speech/music discriminator are presented.
متن کاملA Speech/music Discriminator -based Audio Browser with a Degree of Certainty Measure
In recent years the field of content-based audio signal classification and retrieval has gained a growing amount of interest among researchers around the world. This paper describes a technique, which is used to automatically discriminate audio signals between speech and music. Our goal was to achieve reliable classification results using computationally inexpensive time-domain features. The cl...
متن کاملA new approach for audio classification and segmentation using Gabor wavelets and Fisher linear discriminator
Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified ...
متن کاملUnsupervised Feature Learning for Speech and Music Detection in Radio Broadcasts
Detecting speech and music is an elementary step in extracting information from radio broadcasts. Existing solutions either rely on general-purpose audio features, or build on features specifically engineered for the task. Interpreting spectrograms as images, we can apply unsupervised feature learning methods from computer vision instead. In this work, we show that features learned by a mean-co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006